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Abstract. The relationship between abstract interpretation and partial 
deduction has received considerable attention and (partial) integrations 
have been proposed starting from both the partial deduction and ab- 
stract interpretation perspectives. In this work we present what we ar- 
gue is the first fully described generic algorithm for efficient and precise 
integration of abstract interpretation and partial deduction. Taking as 
starting point state-of-the-art algorithms for context-sensitive, polyvari- 
ant abstract interpretation and (abstract) partial deduction, we present 
an algorithm which combines the best of both worlds. Key ingredients 
include the accurate success propagation inherent to abstract interpre- 
tation and the powerful program transformations achievable by partial 
deduction. In our algorithm, the calls which appear in the analysis graph 
are not analyzed w.r.t. the original definition of the procedure but w.r.t. 
specialized definitions of these procedures. Such specialized definitions 
are obtained by applying both unfolding and abstract executability. Our 
framework is parametric w.r.t. different control strategies and abstract 
domains. Different combinations of such parameters correspond to exist- 
ing algorithms for program analysis and specialization. Simultaneously, 
our approach opens the door to the efficient computation of strictly more 
precise results than those achievable by each of the individual techniques. 
The algorithm is now one of the key components of the CiaoPP analysis 
and specialization system. 



1 Introduction and Motivation 

The relationship between abstract interpretation [3] and partial evaluation [11] 
has received considerable attention (see for example [5, 7, 2, 17, 10, 12, 21, 23, 6, 
15, 4, 22, 13] and their references). In order to motivate and illustrate our proposal 
for an integration of abstract interpretation and partial evaluation, we use the 
running example of Fig. 1. It is a simple Ciao program which uses Peano's 
arithmetic.^ We use the Ciao assertion language in order to provide precise 
descriptions on the initial call patterns. In our case, the entry declaration is 

^ Rules are written with a unique subscript attached to the head atom (the rule num- 
ber), and a dual subscript (rule number, body position) attached to each body literal. 
We sometimes use this notation for denoting calls to atoms as well. 
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:- module(_, [main/1] , [assertions] ) . 

:- entry main(s (s (s (L) ) ) ,R) : (ground (L) ,var(R)) . 
maini(X,X2) :- f ormulai,i (X,X1) , formulai,2 (X1,X2) . 

formula2 (X,W) :- grovmd2,i (X) ,var2,2 (W) ,tW02,3(T) ,minus2,4 (T,X,X2) ,twice2,6 (X2,W) . 

tWO3(s(s(0))) . 

minus4 (0 ,X,X) . 

minuss (s(X) ,s(Y) ,R) :- minuss^i (X,Y,R) . 
minuse (s(_X) ,0,_R) . 
twicer (X, _Y) :- var7,i(X). 
twicesCX.Y) :- grounds.i (X) , tW8,2(X,Y). 

tWgCO.O) . 

twio (s (X) , s (s (NX) ) ) : - twio,i (X,NX) . 

Fig. 1. Running Example 



used to inform that all calls to the only exported predicate (i.e., main/2) will 
always be of the form <— main(s(s(s(L))), R) with L ground and R a variable. 
The predicate main/2 performs two calls to predicate formula/2, which contains 
mode tests ground (X) and var(W) on its input arguments. A call formula (X,W) 
returns W = {X — 2) x 2. Predicate two/1 returns the natural number 2 in 
Peano's arithmetic. A call minus(A,B,C) returns C = B — A. However, if the 
result becomes a negative number, C is left as a free variable. This indicates 
that the result is not valid. In turn, a call twice (A, B) returns B = A x 2. Prior 
to computing the result, this predicate checks whether A is valid, i.e., not a 
variable, and simply returns a variable otherwise. 

By observing the behaviour of the program it can be seen that for initial 
queries satisfying the entry declaration, all calls to the tests ground2,i (X) and 
var2,2 (W) will definitely succeed, even if we do not know the concrete values of 
variable L at compile time. Also, the calls to grounds, i(X) will succeed, while 
the calls to var7^i(X) will fail. This shows the benefits of (1) exploiting abstract 
inform,a,tion in order to abstractly execute certain atoms, which in turn m,ay al- 
low unfolding of other atoms. However, the use of an abstract domain which 
captures groundness and freeness information will in general not be suSicient to 
determine that in the second execution of formula/2 the tests ground2,i (X) and 
var2,2 (W) will also succeed. The reason is that, on success of minus2,4 (T , X , X2) , 
X2 cannot be guaranteed to be ground since minuse/S succeeds with a free vari- 
able on its third argument position. It can be observed, however, that for all 
calls to minus/3 in executions described by the entry declaration, such third 
clause for minus/3 is useless. It will never contribute to a success of minus/3 
since such predicate is always called with a value greater than zero on its second 
argument. Unfolding can make this explicit by fully unfolding calls to minus/3 
since they are sufficiently instantiated (and as a result the "dangerous" third 
clause is disregarded). It allows concluding that in our particular context, all 
calls to minus/3 succeed with a ground third argument. This shows the impor- 
tance of (2) performing unfolding steps in order to prune away useless branches, 
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which will result in improved success information. By the time execution reaches 
twice2,5(X2,W), we hopefully know that X2 is ground. In order to determine 
that, upon success of twice2,5(X2,W) (and thus on success of f ormulai,i(X,W)), 
W is ground, we need to perform a fixpoint computation. Since, for example, 
the success substitution for formulai_i(X,Xl) is indeed the call substitution 
for formulai,2(Xl,X2), the success of the second test ground2,i (X) (i.e., the 
one reachable from foniiulai_2(Xl,X2)) cannot be established unless we prop- 
agate success substitutions. This illustrates the importance of (3) propagating 
(abstract) success information, performing fixpoint computations when needed, 
which simultaneously will result in an improved unfolding. Finally, whenever we 
call formula (X,W) , W is a variable, a property which cannot be captured if we 
restrict ourselves to downwards-closed domains. This indicates (4) the usefulness 
of having information on non downwards-closed properties. 

Throughout the paper we show that the framework we propose is able to 
eliminate all calls to mode tests ground/ 1 and var/1, and predicates two/1 
and ininus/3 are both fully unfolded and no longer appear in the residual code. 
We have used sharing-freeness as abstract domain instead of one based on, say 
regular types, for two reasons.^ First, to illustrate how non-downwards closed 
information, including frceness and definite independence, can be correctly ex- 
ploited by our algorithm in order to optimize the program, and second, to show 
how unfolding can be of great use in order to improve the accuracy of analyses 
apparently unrelated to partial deduction, such as the classical sharing-freeness. 

Example 1. The results obtained by CiaoPP — which implements abstract inter- 
pretation with specialized definitions — are both the following specialized code 
and an accurate analysis for such program (rules are renamed using the prefix 
sp). 

spjnaini (s(s(s(0))) ,0) . 

spjnain2(s(s(s(s(B)))) ,A) :- sp_tW2,i (B,C) , sp_f ormula2,2 (C, A) . 

Sp_tW2(0,0) . 

sp_tW3(s(A) ,s(s(B))) :- sp.twaa (A,B) . 
sp_formula4(0,s(s(s(s(0))))) . 

sp-formulasCsCA) ,s(s(s(s(s(s(B))))))) :- sp_tW5,i (A,B) . 

In this case, the success information for sp_main(X,X2) guarantees that X2 is 
definitely ground on success. Note that this is equivalent to proving MX > 
3, main{X,X2) — > X2 > 0. Furthermore, our system is able to get to that 
conclusion even if the entry only informs about X being any possible ground 

term and X2 a free variable. 

The above results cannot be achieved unless all four points mentioned before 
are available in a program analysis/specialization system. For example, if we use 

traditional partial deduction [19, 8] (PD) with the corresponding Generalize and 
Unfold rules followed by abstract interpretation and abstract specialization as 

^ The values for the rest of parameters are: AGeneralize and AUnfold rules based on 
homeomorphic embedding [14], and the identity function as Widen-Call function. 
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described in [21,22] we only obtain a comparable program after four iterations 
of the: "PD + abstract interpretation + abstract specialization" cycle. If we 
keep on adding more calls to formula, every time more iterations are necessary 
to obtain results comparable to ours. This shows the importance of achieving 
an algorithm which is able to interleave PD, with abstract interpretation, ex- 
tended with abstract specialization, in order to communicate the accuracy gains 
achieved from one to the other as soon as possible. In any case, iterating over 
"PD + analysis" is not a good idea from the efficiency point of view. Also, some- 
times partially evaluating a partially evaluated program can degrade the quality 
of the residual program. 

The remaining of the paper is organized as follows. Section 2 recalls some 
preliminary concepts. In Sect. 3, we present abstract unfolding which already 
integrates abstract executability. Section 4 introduces our notion of specialized 
definition and embeds it within an abstract partial deducer. In Sect. 5, wc pro- 
pose our scheme for abstract interpretation with specialized definitions. Finally, 
Sect. 6 compares to related work and Sect. 7 concludes. 

2 Preliminaries 

Very briefly (see for example [18] for details), an atom A is a syntactic construc- 
tion of the form p{ti, . . . , where p/n, with n > 0, is a predicate symbol and 
ti, . . . ,tn arc terms. A clause is of the form H ^ B where its head H is an atom 
and its body S is a conjunction of atoms. A definite program is a finite set of 
clauses. A goal (or query) is a conjunction of atoms. 

Let G bo a goal of the form ^ Ai, . . . , An, . . . ,Ak, k > 1. The concept 
of computation rule, denoted by TZ, is used to select an atom within a goal 
for its evaluation. The operational semantics of definite programs is based on 
derivations [18]. Let C = H ^ Bi, . . . , Bm be a renamed apart clause in P such 
that 39 = mgu{AR,H). Then ^ 0{Ai,. . . ,Ar_i,Bi,. . . ,B^,Ar+i,. . . ,Ak) is 
derived from G and C via TZ. As customary, given a program P and a goal 
G, an SLD derivation for P U {G} consists of a possibly infinite sequence G ~ 
Go, Gi, G2, . . . of goals, a sequence Gi, G2, . . . of properly renamed apart clauses 
of P, and a sequence 61,62,- ■■ of mgus such that each Gj+i is derived from 
Gi and Ci+i using A derivation step can be non-deterministic when Ar 

unifies with several clauses in P, giving rise to several possible SLD derivations 
for a given goal. Such SLD derivations can be organized in SLD trees. A finite 
derivation G = Gq , Gi , G2 , . . . , G„ is called successful if G„ is empty. In that 
case 9 = 9\92 ■ ■ - dn is called the computed answer for goal G. Such a derivation 
is called failed if it is not possible to perform a derivation step with G„. Given 
an atom A, an unfolding rule [19, 8] computes a set of finite SLD derivations 
Di,. . . , Dn (i.e., a possibly incomplete SLD tree) of the form Di = A,...,Gi 
with computed answer substitution diiovi = 1, . . . , n whose associated resultants 
(or residual rules) arc 9i{A) ^ Gi. 

The following standard operations are used in the paper to handle keyed- 
tables: Create_Table(T) initializes a table T. \nsert{T, Key , Info) adds Info as- 
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sociated to Key to T and deletes previous information associated to Key, if 
any. \s\n{T, Key) returns true iff Key is currently stored in the table. Finally, 
Look_up(T, Key) returns the information associated to Key in T. For simplicity, 
we sometimes consider tables as sets and we use the notation {Key Info) e T 
to denote that there is an entry in the table T with the corresponding Key and 
associated Info. 

2.1 Abstract Interpretation 

Abstract interpretation [3] provides a general formal framework for computing 
safe approximations of programs behaviour. Programs are interpreted using val- 
ues in an abstract domain (Da) instead of the concrete dom,ain [D). The set of 
all possible abstract values which represents Da is usually a complete lattice or 
cpo which is ascending chain finite. Values in the abstract domain (-Da, C) and 
sets of values in the concrete domain (2^, C) are related via a pair of monotonic 
mappings (a, 7): the abstraction function a : 2^ — > Da which assigns to each 
(possibly infinite) set of concrete values an abstract value, and the concretization 
function 7 : D^ 2^ which assigns to each abstract value the (possibly infinite) 
set of concrete values it represents. The operations on the abstract domain Dq, 
that we will use in our algorithms are: 

— Arestrict(A, S) performs the abstract restriction (or projection) of a substi- 
tution A to the set of variables in the expression E, denoted vars(E); 

— Aextend(A, E) extends the substitution A to the variables in the set vars{E); 

— Aun\f{ti,t2, A) obtains the description which results from adding the abstrac- 
tion of the unification ti = ^2 to the substitution A; 

— Aconj(Ai,A2) performs the abstract conjunction (n) of two substitutions; 

— Alub(Ai, A2) performs the abstract disjunction (U) of two substitutions. 

In our algorithms we also use Atranslate(A : CP,II <— B) which adapts and 

projects the information in an abstract atom A : CP to the variables in the clause 
C = H ^ B. An abstract atom of the form G : CP is a concrete atom G which 
comes equipped with an abstract substitution CP which is defined over vars{G) 
and provides additional information on the context in which the atom will be 
executed at run-time. Atranslate can be defined in terms of the operations above 
as: Atranslate(A : CP,H ^ B) = Arestrict(Aunif(^, il, Aextend(CP, C)), C). Fi- 
nally, the most general substitution is represented as T, and the least general 
(empty) substitution as _L. 

3 Unfolding with Abstract Substitutions 

We now present an extension of SLD semantics which exploits abstract informa- 
tion. This will provide the means to overcome difficulties (1) and (2) introduced 
in Section 1 . The extended semantics handles abstract goals of the form G : CP, 
i.e., a concrete goal G comes equipped with an abstract substitution CP defined 
over vars{G). The first rule corresponds to a derivation step. 
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Definition 1 (derivation step). Let G : CP be an abstract goal where G =<— 
Ai, . . . , Afi, . . . , Afc. Let TZ be a computation rule and let Tl{G) =Ap. Let C = 
H <— Bi, . . . , Bjn be a renamed apart clause in P. Then the abstract goal G' : CP' 
is derived from G : CP and C via IZ if39 = mgu{Aji, H) A CP„ ±, where: 

CPu = Aunif(Afl, HO, Aextend(CP, C6)) 
G' = 9{Ai, . . . , Aii_i,Bi, . . . , Bm, ^H+i, • • • , Ak) 
CP' = Arestrict(CP„, uars(G')) 

An important difference between tlie above definition and the standard deriva- 
tion step is ttiat tlie use of abstract (call) substitutions allows imposing further 
conditions for performing derivation steps, in particular, CP„ cannot be _L. This 
is because if CP 7^ _L and CP„ = _L then the head of the clause C is incom- 
patible with CP and the unification Ar = H will definitely fail at run-time. 
Thus, abstract information allows us to remove useless clauses from the residual 
program. This produces more efficient resultants and increases the accuracy of 
analysis for the residual code. 

Example 2. Consider the abstract atom f ormula(s*(X), X2) : {X/G, X2/V}. which 
appears in the analysis of our running example (c.f. Fig. 2). We abbreviate as 
s°(X) the successive application of n functors s to variable X. The notation X/G 
(resp. X/V) indicates that variable X is ground (resp. a free variable). After ap- 
plying a derivation step, we obtain the derived abstract goal: 

ground(s*(X)), var(X2), two(T), minus(T, s*(X), X2'), twice(X2', X2) : {X/G, X2/V, T/V, X27V} 

where the abstract description has been extended with updated information 
about the freeness of the newly introduced variables. In particular, both T and 
X2 ' are V. 

The second rule we present makes use of the availability of abstract sub- 
stitutions to perform abstract executability [21] during resolution. This allows 
replacing some atoms with simpler ones, and, in particular, with the predefined 
atoms true and false, provided certain conditions hold. We assume the existence 
of a predefined abstract executability table which contains entries of the form 
T : CP ^ T' which specify the behaviour of external procedures: builtins, li- 
braries, and other user modules. For instance, for predicate ground contains the 
information ground(X) : {X/G} true. For var, it contains var(X) : {X/V} 
true.^ 

Definition 2 (abstract execution). Let G : CP be an abstract goal where 
G =^ Ai, . . . , An, . . . ,Ak. Let TZ be a computation rule and let TZ{G) =Afi. 
Let (T : CPr T') be a renamed apart entry in the abstract executability table. 
Then, the goal G' : CP' is abstractly executed from G : CP and (T : CPr T') 
via TZif Ar = e{T) and CPa E CPt, where 

C = Ai, . . . , 0(r'), ...,Ak 

^ In CiaoPP we use assertions to express such information in a domain-independent 
manner. 



Generic Framework for Analysis and Specialization of Logic Programs 



7 



CP' = Arestrict(CP,G') 
CPa = Atranslate(Afl : CP,T ^ true) 

Example 3. From the derived goal in Ex. 2, we can apply twice the above rule 
to abstractly execute the calls to ground and var and obtain: 

two(T),ininus(T,s*(X),X2'),twice(X2',X2) : {X/G, X2/V, T/V, X27v} 

since both calls succeed by using the abstract executability table described above 
and the information in the abstract substitution. 

Definition 3 (AUnfold). Let A : CP be an abstract atom and P a program. 
We define AUnfold{P. A : CP) as the set of resultants associated, to a, finite 
(possibly incomplete) SLD tree computed by applying tfie rules of Definitions 1 
and 2 to A: CP. 

The so-called local control of PD ensures the termination of the above process. 

For this purpose, the unfolding rule must incorporate some mechanism to stop 
the construction of SLD derivations (we refer to [14] for details). 

Example 4- Consider an unfolding rule AUnfold based on homeomorphic em- 
bedding [14] to ensure termination and the initial goal in Ex. 2. The derivation 
continuing from Ex. 3 performs several additional derivation steps and abstract 
executions and branches (we do not include them due to space limitations and 
also because it is well understood). The following resultants are obtained from 
the resulting tree: 

forinula(s(s(s(s(0) ,s(s(s(s(0))))) . 

formula(s(s(s(s(s(A))))) ,s(s(s(s(s(s(B))))))) :- tw(A,B) 

which will later be filtered and renamed resulting in rules 4 and 5 of Ex. 1. 

It is important to note that SLD resolution with abstract substitiitions is not 
restricted to the left-to-right computation rule. However, it is well-known that 
non-leftmost derivation steps can produce incorrect results if the goal contains 
impure atoms to the left of An. More details can be found, e.g., in [16]. Also, 
abstract execution of non-leftmost atoms can be incorrect if the abstract domain 
used captures properties which are not downwards closed. A simple solution is to 
only allow leftmost abstract execution for non-downwards closed domains (and 
non- leftmost for derivation steps). 

4 Specialized Definitions 

We now define an Abstract Partial Deduction (APD) algorithm whose execu- 
tion can later be interleaved in a seamless way with a state-of-the-art abstract 
interpreter. For this it is essential that the APD process can generate residual 
code online. Thus, we need to produce a residual, specialized definition for a 
call pattern as soon as we finish processing it. This will make it possible for the 
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Algorithm 1 Abstract Partial Deduction with Specialized Definitions 

1: procedure partial_evaluation_with_spec_defs(P, {Ai : CPi, . . . ,An ■ CP„}) 
2: Create_Table(gT); Create_Table(5T) 
3: for j = l..n do 

4: PROCESS_CALL_PATTERN(Aj : CPj) 

5: procedure process_call_pattern(A : CP) 

6: if not \s\n(gT,A : CP) then 

7: {A\,A'^) ^ specialized_definition(P, ^ : CP) 

8: Ai : CPi ^ Look_up(gT, A : CP) 

9: for all ren. apart clause Ck = Hk <^ Bk £ P s.t. Hk unifies with A'l do 

10: CPk ^ Atranslate(yli : CPi,Ck) 

11: PROCESS_CLAUSE(C'Pfc , Sfc) 

12: procedure process_CLAUSe(CP, B) 

13: if B = (L, P) then 

14: CPl ^ Arestrict(CP, L) 

15: process_call_pattern(L : CPl) 

16: pr,ocess_clause(CP, R) 

17" else 

is! CPs ^ Arestrict(CP, B) 

19: process_call_pattern(S : CPs) 

20: function specialized_definition(P, A : CP) 

21: A' : CP' ^ AGeneralize{ST, A : CP) 

22: lnsert(gT, A : CP, A' : CP') 

23: if lsln(<ST,yl' : CP') then 

24: A" •^Look_up(<ST, A' : CP') 

25" else 

26! Def ^ A Unfold{P, A' : CP') 

27: A" 4- newJilter(^') 

28: \nsert{ST,A' : CP',A") 

29: Def ^ {{H' ^ B) \ {H ^ B) € Def A H' = ren{H, {A' /A"})} 

30: P^PUDef 

31: return (,!'. ,1") 



analysis algorithm to have access to the improved definition. This may increase 
the accuracy of the analyzer and addresses the difficulty (2) described in Sect. 1. 

Typically, PD is presented as an iterative process in which partial evalua- 
tions are computed for the new generated atoms until they cover all calls which 
can appear in the execution of the residual program. This is formally known 
as the dosedness condition of PD [19]. In order to ensure termination of this 
global process, the so-called global control defines a AGeneralize operator (see 
[14]) which guarantees that the number of SLD trees computed is kept finite, 
i.e., it ensures the finiteness of the set of atoms for which partial evaluation is 
produced. However, the residual program is not generated until such iterative 
process terminates. 

Algorithm 1 presents an APD algorithm. The main difference with standard 
algorithms is that the resultants computed by AUnfold (L26) are added to the 
program during execution of the algorithm (L30) rather than in a later code 
generation phase. In order to avoid conflicts among the new clauses and the 
original ones, clauses for specialized definitions are renamed with a fresh pred- 
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icate name (L29) prior to adding them to the program (L30). The algorithm 
uses two global data structures. The specialization table contains entries of the 
form A : CP ^ A'. The atom A' provides the link with the clauses of the spe- 
cialized definition for A : CP. The generalization table stores the results of the 
A Generalize function and contains entries A : CP A' : CP where A' : CP' is 
a generalization of A : CP. 

Computation is initiated by procedure PARTlAL_EVALUATlON_WlTH_SPEC_DEFS 
(Ll-4) which initializes the tables and calls process_call_pattern for each 
abstract atom Ai : CPi in the initial sot to be partially evaluated. The task 
of PROCESS_CALL_PATTERN is, if the atom has not been processed yet (L6), to 
compute a specialized definition for it (L7) and then process all clauses in its 
spcciahzcd definition by means of calls to PROCESS_CLAUSE (L9-11). Procedure 
PROCESS_CLAUSE traverses clause bodies, processing their corresponding atoms 
by means of calls to process_call_pattern, in a depth-first, left-to-right fash- 
ion. The order in which pending call patterns (atoms) are handled by the algo- 
rithm is usually not fixed in PD algorithms. They are often all put together in a 
set. The reason for this presentation is to be as close as possible to our analysis 
algorithm which enforces a depth- first, left-to- right traversal of program clauses. 
In this regard, the relevant point to note is that this algorithm does not per- 
form success propagation yet (difhculty 3). In L16, it becomes apparent that the 
atom(s) in R will be analyzed with the same call pattern CP as L, which is to 
their left in the clause. This, on one hand, may clearly lead to substantial pre- 
cision loss. For instance, the abstract pattern f ormula(C, A) : {C/G,C/V} which 
is necessary to obtain the last two resultants of Ex. 1 cannot be obtained with 
this algorithm. In particular, we cannot infer the groundness of C which, in turn, 
prevents us from abstractly executing the next call to ground and, thus, from 
obtaining this optimal specialization. On the other hand, this lack of success 
propagation makes it difficult or even impossible to work with non downwards 
closed domains, since CP may contain information which holds before execution 
of the leftmost atom L but which can be uncertain or even false after that. In 
fact, in our example CP contains the info C/V, which becomes false after execu- 
tion of tw(B,C), since now C is ground. This problem is solved in the algorithm 
wc present in the next section, where analysis information flows from left to 
right, adding more precise information and eliminating information which is no 
longer safe or even definitely wrong. 

For the integration we propose, the most relevant part of the algorithm com- 
prises L20-31, as it is the code fragment which is directly executed from our 
abstract interpreter. The remaining procedures (L1-L19) will be overridden by 
more accurate ones later. The procedure of interest is specialized .definition. 
As it is customary, it performs (L21) a generalization of the call A : CP using 
the abstract counterpart of the Generalize operator, denoted by AGeneralize, 
and which is in charge of ensuring termination at the global level. The result of 
the generalization. A' : CP' , is inserted in the generalization table QT (L22). 
Correctness of the algorithm requires that A : CP C A' : CP'. If A' : CP' has 
been previously treated (L23), then its specialized definition A" is looked up in 
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ST (L24) and returned. Otherwise, a specialized definition Def is computed for 
it by using the AUnfold operator of Def. 3 (L26). As aheady mentioned, the spe- 
ciahzcd definition Def for the abstract atom A : CP is used to extend the original 
program P. First, the atom A' is renamed by using newJilter which returns an 
atom with a fresh predicate name, A" , and optionally filters constants out (L27). 
Then, function ren is applied to rename the clause heads using atom A' (L29). 
rQr\{A,{B / B'}) returns 0{B') where 6 = mgu{A,B). Finally, the program P is 
extended with the new, renamed specialized definition, Def'. 

Example 5. Three calls to specialized_definition appear (within an oval box) 
during the analysis of our running example in Fig. 2 from the following abstract 
atoms, first main(s^(X), X2) : {X/G,X2/V}, then tw(B, C) : {B/G,C/V} and finally 
f (C, A) : {C/G, C/V}. The output of such executions is used later (with the proper 

renaming) to produce the resultants in Ex. 1. For instance, the second clause 
obtained from the first call to SPECIALIZED_definition is 

sp_main2(s(s(s(s(B)))) ,A) :- tW2,i(B,C) ,formula2,2(C,A) . 

where only the head is renamed. The renaming of the body literals is done in a 
later code generation phase As already mentioned, Alg. 1 is not able to obtain 
such abstract atoms due to the absence of success propagation. 



5 Abstract Interpretation with Specialized Definitions 

Wc now present our final algorithm for abstract interpretation with specialized 
definitions. This algorithm extends both the APD Algorithm 1 and the abstract 
interpretation algorithms in [20,9]. W.r.t. Algorithm 1, the main improvement 
is the addition of success propagation. Unfortunately, this requires computing 
a global fixpoint. It is an important objective for us to be able to compute an 
accurate fixpoint in an efficient way. W.r.t the algorithms in [20,9], the main 
improvements are the following. (1) It deals directly with non-normalized pro- 
grams. This point, which does not seem very relevant in a pure analysis system, 
becomes crucial when combined with a specialization system in order to profit 
from constants propagated by unfolding. (2) It incorporates a hardwired effi- 
cient graph traversal strategy which eliminates the need for maintaining priority 
queues explicitly [9]. (3) The algorithm includes a widening operation for calls, 
Widen_Call, which limits the amount of multivariancc in order to keep finite 
the number of call patterns analyzed. This is required in order to be able to 
use abstract domains which are infinite, such as regular types. (4) It also in- 
cludes a number of simplifications to facilitate understanding, such as the use of 
the keyed-table ADT, which we assume encapsulates proper renaming apart of 
variables and the application of renaming transformations when needed. 

In order to compute and propagate success substitutions. Algorithm 2 com- 
putes a program analysis graph in a similar fashion as state of the art analyzers 
such as the CiaoPP analyzer [20, 9]. For instance, the analysis graph computed 
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^SPEC-DEF(maln(s^(X),X2) : {X/G, X2/V})j 

Diain(s^(b)', 0); main(s*(B), A); 

□ {B/G,C/V}^^^g^(,^{B/G,C/G} ^ {C/G,A/V}f|-(,^ j^-j{C/G,A/G} 

SPECJ)EF(tw(B,C) : {8/^, C/V}) j ^'"'•^^ SPECJ)EF(f (C, A) : {C/G,A/V}) 



□ {B/G,C/V}^^^g^^-j{B/G,C/G} □ {A/G,B/V}^^|-j^^g-j{A/G,B/G} 



Fig. 2. Analysis Graph computed by ABS.int.with.SPECJ3EF 



by Algorithm 2 for our running example is depicted in Fig. 2. The graph has 
two sorts of nodes. Those which correspond to atoms are called "OR- nodes". 
For instance, the node W(^,^2/w}^g^^^^^3^y^-^ y^2)i^/°'^^/°} indicates that when the 
atom inain(s^(X), X2) is called with description {X/G,X2/V} the answer (or suc- 
cess) substitution computed is {X/G, X2/G}. Those nodes which correspond to 
rules are called "AND-nodes" . In Fig. 2, they appear within a dashed box and 
contain the head of the corresponding clause. Each AND-node has as children 
as many OR-nodes as literals there are in the body. If a child OR-node is al- 
ready in the tree, it is no further expanded and the currently available answer 
is used. For instance, the analysis graph in Figure 2 contains three occurrences 
of the abstract atom tw(B,C) : {B/G,C/V} (modulo renaming), but only one of 
them has been expanded. This is depicted by arrows from the two non-expanded 
occurrences of tw(B,C) : {B/G, C/V} to the expanded one. More information on 
the efficient construction of the analysis graph can be found in [20,9, 1]. 

The program analysis graph is implicitly represented in the algorithm by 
means of two data structures, the answer table {AT) and the dependency table 
iVT). The answer table contains entries of the form A : CP AP which are 
interpreted as the answer (success) pattern for A : CP is AP. For instance, there 
exists an entry of the form main(s^(X), X2) : {X/G,X2/V} ^ {X/G,X2/G} associ- 
ated to the atom discussed above. Dependencies indicate direct relations among 
OR-nodes. An OR-node Ap : CPp depends on another OR-node A^ : CPt iff in 
the body of some clause for Ap : CPp there appears the OR-node At : CPt- The 
intuition is that in computing the answer for A p : CPp we have used the answer 
pattern for At : CPt- In our algorithm we store backwards dependencies,^ i.e., 
for each OR-node At : CPt we keep track of the set of OR-nodes which depend 



* In the implementation, for efficiency, both forward and backward dependencies axe 
stored. We do not include them in the algorithm for simplicity of the presentation. 
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Algorithm 2 Abstract Interpretation with Specialized Definitions 



1: procedure abs_int_with_spec_defs(P, {Ai : CPi, . . . ,An : CP„}) 

2: Create_Table(^T); Create_Table(l?T) 

3: Create_Table(£?T); Create_Table(<ST) 

4: for j = l..n do 

5: PROCESS_CALL_PATTERN(ylj : CPj, {Aj : CPj =^ [Aj : CPj],j, entry)) 

6: function process_CALl_pattern(^ : CP, Parent) 

7: CPi ^ Widen_Call{AT, A : CP) 

8: if not \s\n{AT, A : CPi) then 
9: lnsert(^T,A : CPi,_L) 

10: \nsen{VT,A:CPi,9) 

11: {A',A'i) ^ specialized_definition(P, A : CPi) 

12: A" ^ren{A,{A'/A[}) 

13: for all rcn. apart clause Ck = Hk *— Bk € P s.t. Hk unifies with A" do 

14: CPk ^ Atranslate(A" : CPi, Ck) 

15: process_clause(^ : CPi ^ [Hk ■■ CPk] Bk,k, 1) 

16: Deps ^ Look_up(r'T, A : CPi) \J{Parent} 

17: lnsert(CT, A : CPi , Deps) 

18: return Look_up(^T, ^ : CPi) 

19: procedure PR,OCESS_CLAUSE(i/ -.CP ^ [Hk : CPi] B,k,i) 

20: if CPi ^ ± then 

21: if B = (L, i?) then 

22: CP2 ^ Arestrict(CPi, L) 

23: APo <- process_call_pattern(L : CP2. {H -.CP ^ [Hk : CPi], k, i)) 

24: CPs ^ Aconj(CPi, Aextend(APo, CPi)) 

25: PROCESS_CLAUSE(_ff : CP [Hk : CP3] R,k,i + 1) 

26' else 

27! CP2 ^ Arestrict(CPi, S) 

28: APo <- process_call_pattern(P : CP2, {H -.CP^ [Hk : CPi], A;, i}) 

29: CPs ^ Aconj(CPi,Aextend(^Po,CPi)) 

30: APi <- Atranslate(J7fc : CP3, <- trwe) 

31: ylP2 ^ Look_up(^T, _H" : CP) 

32: ylPs ^ Alub(APi,4P2) 

33: if ylP2 / APi then 

34: lnsert(^T, H : CP, AP.^) 

35: Deps ^ Look_up(I?r, H : CP) 

36: PROCESS_UPDATE(7?eps) 

37: procedure PROCESS_UPDATE(?7pdates) 

38: if Updates — {Ai, .... A„} with n > then 

39: Ai = (H -.CP ^ [Hk : CPi], k, i) 

40: if i ^ entry then 

41: S ^ get_body(P, k, i) 

42: REMOVE_PREVIOUS_DEPS(i/ : CP ^ [i/fc : CPi] B,k,i) 

43: PROCESS_OLAi:sE(i/ : CP ^ f/ffr : CPi] P. k. i) 

44: PU()C'ESs_ri'DATE((;p(/«f(:.s - {.U}) 



on it. That is to say, the keys in the dependency table are OR-nodes and the 
information associated to each node is the set of other nodes which depend on it, 
together with some additional information required to iterate when an answer is 
modified (updated). Each element of a dependency set for an atom B : CP2 is of 
the form {H : CP => [Hf. : CPi] k, i). It should be interpreted as follows: the OR- 
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node H : CP through the Hteral at position k, i depends on the OR- node B : CP2. 
Also, the remaining information [Hk : CPi] informs that the head of this clause is 
Hk and the substitution (in terms of all variables of clause k) just before the call 
to B : CP2 is CPi. Such information avoids reprocessing atoms in the clause k 
to the left of position i. For instance, the dependency set for f (C, A) : {A/V, C/G} is 
{(main(s3(X),X2) : {X/G,X2/V}^ [main(s*(B), A) : {B/G, A/V, C/G}]2, 2}}. It indi- 
cates that the OR-node f (C, A) : {A/V,C/G} is only used, via literal (2,2), in the 
0R-nodcmain(s^(X),X2) : {X/G,X2/V} (sec Example 1). Thus, if the answer pat- 
tern for f (C, A) : {A/V, C/G} is ever updated, then we must reprocess the OR-node 
{main(s2(X),X2) : {X/G,X2/V} from position 2,2. 

Algorithm 2 proceeds as follows. The procedure absjnt_with_spec_defs 
initializes the four tables used by the algorithm and calls process_CALL_pattern 
for each abstract atom in the initial set. The function PROCESS_CALL_pattern 
applies, first of all (L7), the Widen_Call function to A : CP taking into account 
the set of entries already in AT. This returns a substitution CPi s.t. CP C CPi. 
The most precise Widen-Call function possible is the identity function, but it 
can only be used with abstract domains with a finite number of abstract values. 
This is the case with sharing-freeness and thus we will use the identity function 
in our example. If the call pattern A : CPi has not been processed before, it 
places (L9) ± as initial answer in AT for A : CP and sets to empty (LIO) the set 
of OR-nodes in the graph which depend on A : CPi. It then computes (Lll) a 
specialized definition for A : CP\ . We do not show in Algorithm 2 the definition 
of SPECIALIZED_DEFINITI0N, since it is identical to that in Algorithm 1. In the 
graph, we show within an oval box the calls to specialized_definition which 
appear during the execution of the running example (see the details in Sect. 4). 
The clauses in the specialized definition are linked to the box with a dotted 
arc. Then it launches (L13-15) calls to pr,OCESS_GLAUSE for the clauses in the 
specialized definition w.r.t. which A : CPi is to be analyzed. Only after this, the 
Parent OR-node is added (L16-17) to the dependency set for A : CPi. 

The function PROCESS_CLAUSE performs the success propagation and con- 
stitutes the core of the analysis. First, the current answer (^Po) for the call 
to the literal at position k,i of the form B : CP2 is (L24 and L29) conjoined 
(Aconj), after being extended (Aextend) to all variables in the clause, with the 
description CPi from the program point immediately before B in order to ob- 
tain the description CP3 for the program point after B. If B is not the last 
literal, CP3 is taken as the (improved) calling pattern to process the next lit- 
eral in the clause in the recursive call (L25). This corresponds to Icft-to-right 
success propagation and is marked in Fig. 2 with a dashed horizontal arrow. If 
we are actually processing the last literal, CP3 is (L30) adapted (Atranslate) to 
the initial call pattern H : CP which started process_CLAUSE, obtaining APi. 
This value is (L32) disjoined (Alub) with the current answer, AP2, for H : CP 
as given by Look_up. If the answer changes, then its dependencies, which are 
readily available in VT, need to be recomputed (L36) using process_UPDATE. 
This procedure restarts the processing of all body postfixes which depend on 
the calling pattern for which the answer has been updated by launching new 
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calls to PROCESS-CLAUSE. There is no need of recomputing answers in our ex- 
ample. The procedure REM0VE_PREVI0US_DEPS eliminates (L42) entries in VT 
for the clause postfix which is about to be re-computed. We do not present its 
definition here due to lack of space. Note that the new calls to process_clause 
may in turn launch calls to PR0CESS_UPDATE. On termination of the algorithm 
a global fixpoint is guaranteed to have been reached. Note that our algorithm 
also stores in the dependency sets calls from the initial entry points (marked 
with the value entry in L5). These do not need to be reprocessed (L40) but are 
useful for determining the specialized version to use for the initial queries after 
code generation. 

5.1 Termination of Abstract Interpretation with Specialized 
Definitions 

Termination of Algorithm 2 comprises several levels. First, termination of the 
algorithm requires the local termination of the process of obtaining a special- 
ized definition. This corresponds to ensuring termination of function SPECIAL- 
IZED_DEFINITI0N in Algorithm 1. Second, we need to guarantee that the num- 
ber of call patterns for which a specialized definition is computed is finite. This 
corresponds to global termination of specialization algorithms. In terms of our 
algorithm, this is equivalent to having a finite number of entries in ST. The 
AGeneralize function should be able to guarantee it. Third, it is required that 
the set of call patterns for which an answer pattern is to be computed be fi- 
nite. This corresponds to control of multivariance in context-sensitive analysis. 
In terms of our algorithm, this is equivalent to having a finite number of entries 
in AT. The Widen_Call function should be able to guarantee it. Fourth and 
final, it is required that the computation of the answer pattern for each entry in 
AT needs a finite number of iterations. This is guaranteed since we consider do- 
mains which are ascending chain finite. Another way of looking at this problem 
is that, intuitively, the combined effect of terminating AUnfold and AGeneralize 
operators guarantee that the set of specialized definitions which Algorithm 2 will 
compute for an initial set of atoms is finite. These two problems have received 
considerable attention by the PD community (see, e.g., [14]). Since Algorithm 2 
performs analysis of the program composed of the set of specialized definitions, 
once we have guaranteed the finiteness of the program to be analyzed, a termi- 
nating Wideri-Call together with an abstract domain which is ascending chain 
finite guarantee termination of the whole process. 

6 Discussion and Related Work 

We have presented a generic framework for the analysis and specialization of 
logic programs which is currently the basis of the analysis/specialization system 
implemented in the CiaoPP preprocessor. We argue that, in contrast to other 
approaches, the fact that our method can be used both as a specializer and 
analyzer gives us more accuracy and efficiency than the individual techniques. 
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Indeed, the versatility of our framework (and of our implementation) can be 
seen by recasting well-known specialization and analysis frameworks as instances 
in which the different parameters: unfolding rule, widen call rule, abstraction 
operator, and analysis domain, take the following values. 

Polyvariant Abstract Interpretation: Our algorithm can behave as the analy- 
sis algorithm described in [9, 20] for polyvariant static analysis by defining a 
AGeneralize operator which returns always the base form of an expression (i.e., 
it loses all constants) and an AUnfold operator which performs a single deriva- 
tion step (i.e., it returns the original definition). Thus, the resulting framework 
would always produce a residual program which coincides with the original one 
and can be analyzed with any abstract domain of interest. 

Multivariant Abstract Specialization: The specialization power of the framework 
described in [22,21] can be obtained by using the same AGeneralize described 
in the above point plus an AUnfold operator which always performs a derive 
step followed by zero or more abstract execution steps. It is interesting to note 
that in the original framework abstract executability is performed as an offline 
optimization phase while it is performed online in our framework. 

Glassical Partial Deduction: Our method can be used to perform classical PD in 
the style of [19, 8] by using an abstract domain with the single abstract value T 
and the identity function as Widen_Call rule. This corresponds to the VD domain 
of [13] in which an atom with variables represents all its instances. Let us note 
that, in spite of the fact that the algorithm follows a left-to-right computation 
flow, the process of generating specialized definitions (as discussed in Section 3) 
can perform non-leftmost unfolding steps and achieve optimizations as powerful 
as in PD. 

Abstract Partial Deduction: Several approaches have been proposed which ex- 
tend PD by using abstract substitutions [12, 6, 15, 13]. In essence, such approaches 
are very similar to the abstract partial deduction with call propagation shown 
in Algorithm 1. Though all those proposals identify the need of propagating 
success substitutions, they either fail to do so or propose means for propagating 
success information which are not fully integrated with the APD algorithm and, 
in our opinion, do not fit in as nicely as the use of and-or trees. Also, these 
proposals are either strongly coupled to a particular (downward closed) abstract 
domain, i.e.. regular types, as in [6, 15] or do not provide the exact description 
of operations on the abstract domain which are needed by the framework, other 
than general correctness criteria [12,13]. However, the latter allow conjunctive 
PD, which is not available in our framework. 

The approach in [23]: was a starting step towards our current framework. There, 
the introduction of unfolding steps directly in the and or graph was proposed in 
order to achieve transformations as powerful as those of PD while at the same 
time propagating abstract information. In contrast, we now resort to augmented 
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SLD semantics for the specialization side of the framework while using AND- 
OR semantics for the analysis side of the framework. This has both conceptual, 
the hybrid approach we propose provides satisfactory answers to the four issues 
raised in Section 1, and practical advantages, since the important body of work 
in control of PD is directly applicable to the specialization side of our framework. 

7 Conclusions 

Wc have proposed a novel scheme for a seamless integration of the techniques of 
abstract interpretation and partial deduction. Our scheme is parametric w.r.t. 
the abstract domain and the control issues which guide the partial deduction 
process. Existing proposals for the integration use abstract interpretation as a 
means for improving partial evaluation rather than as a goal, at the same level 
as producing a specialized program. This implies that, as a result, their objective 
is to yield a set of atoms which determines a partial evaluation rather than to 
compute a safe approximation of its success. Unlike them, a main objective of 
our work is to improve success information by analyzing the specialized code, 
rather than the original one. Wc achieve this objective by smoothly interleaving 
both techniques which improves success information — even for abstract domains 
which are not related directly to partial evaluation. Moreover, with more accu- 
rate success information, wc can improve further the quality of partial evaluation. 
The overall method thus yields not only a specialized program but also a safe 
approximation of its behaviour. 
Acknowledgments 

The authors would like to thank John Gallagher and Michael Leuschel for useful dis- 
cussions on the integration of abstract interpretation and partial deduction. This work 
was funded in part by the Information Society Technologies programme of the Euro- 
pean Commission, Future and Emerging Technologies under the IST-2001-38059 ASAP 
project and by the Spanish Ministry of Science and Education under the MCYT TIC 
2002-0055 GUBICO project. Manuel Hermenegildo is also supported by the Prince of 
Asturias Chair in Information Science and Technology at UNM.2 

References 

1. M. Bruynooghe. A Practical Framework for the Abstract Interpretation of Logic 
Programs. Journal of Logic Programming, 10:91-124, 1991. 

2. C. Consel and S.C Koo. Parameterized partial deduction. ACM Transactions on 
Programming Languages and Systems, 15(3):463-493, July 1993. 

3. P. Cousot and R. Cousot. Abstract Interpretation: a Unified Lattice Model for 
Static Analysis of Programs by Construction or Approximation of Fixpoints. In 
Proc. ofPOPL'77, pages 238-252, 1977. 

4. P. Cousot and R. Cousot. Systematic Design of Program Transformation Frame- 
works by Abstract Interpretation. In Proc. of POPL'02, pages 178-190. ACM, 
2002. 

5. J. Gallagher, M. Codish, and E. Shapiro. Specialisation of Prolog and FCP Pro- 
grams Using Abstract Interpretation. New Generation Computing, 6(2-3):159-186, 
1988. 



Generic Framework for Analysis and Specialization of Logic Programs 



17 



6. J. P. Gallagher and J. C. Peralta. Regular tree languages as an abstract domain in 
program specialisation. Higher Order and Symbolic Computation, 14(2,3) :143-172, 
2001. 

7. J. P. Gallagher. Static Analysis for Logic Program Specialization. In Workshop on 

Static Analysis WSA '92, pages 285 294, 1992. 

8. J. P. Gallagher. Tutorial on specialisation of logic programs. In Proc. of PEPM'93, 
pages 88-98. ACM Press, 1993. 

9. M. Hermenegildo, G. Puebla, K. Marriott, and P. Stuckey. Incremental Analysis of 
Constraint Logic Programs. ACM Transactions on Programming Languages and 
Systems, 22(2):187-223, March 2000. 

10. N. D. Jones. Combining Abstract Interpretation and Partial Evaluation. In Static 
Analysis Symposium, number 1140 in LNCS, pages 396-405. Springer- Verlag, 1997. 

11. N.D. Jones, C.K. Gomard, and P. Sestoft. Partial Evaluation and Automatic Pro- 
gram Generation. Prentice Hall, New York, 1993. 

12. M. Leuschel. Program Specialisation and Abstract Interpretation Reconciled. In 
Joint International Conference and Symposium on Logic Programming, June 1998. 

13. M. Leuschel. A framework for the integration of partial evaluation and abstract 
interpretation of logic programs. ACM Transactions on Programming Languages 
and Systems, 26(3) :413 - 463, May 2004. 

14. M. Leuschel and M. Bruynooghe. Logic program specialisation through partial de- 
duction: Control issues. Theory and Practice of Logic Programming, 2(4 & 5):461- 
515, July & September 2002. 

15. M. Leuschel and S. Gruner. Abstract conjunctive partial deduction using regular 
types and its application to model checking. In Proc. of LOPSTR, number 2372 
in LNCS. Springer, 2001. 

16. M. Leuschel, J. J0rgensen, W. Vanhoof, and M. Bruynooghe. Offline specialisation 
in Prolog using a hand-written compiler generator. Theory and Practice of Logic 
Programming, 4(1):139-191, 2004. 

17. Michael Leuschel and De Sclireye. Logic program specialisation; How to be more 
specific. In Proc. of PLILP'96, LNCS 1140, pages 137-151, 1996. 

18. J.W. Lloyd. Foundations of Logic Programming. Springer, second, extended edi- 
tion, 1987. 

19. J.W. Lloyd and J.C. Shepherdson. Partial Evaluation in Logic Programming. 
Journal of Logic Programming, ll(3-4):217-242, 1991. 

20. G. Puebla and M. Hermenegildo. Optimized Algorithms for the Incremental Anal- 
ysis of Logic Programs. In Proc. of S AS '96, pages 270-284. Springer LNCS 1145, 
1996. 

21. G. Puebla and M. Hermenegildo. Abstract Multiple Specialization and its Ap- 
plication to Program Parallelization. J. of Logic Programming., 41(2&3):279-316, 
November 1999. 

22. G. Puebla and M. Hermenegildo. Abstract Specialization and its Applications. In 
Proc. of PEPM'03, pages 29-43. ACM Press, 2003. Invited talk. 

23. G. Puebla, M. Hermenegildo, and J. Gallagher. An Integration of Partial Eval- 
uation in a Generic Abstract Interpretation Framework. In Proc. of PEPM'99, 
number NS-99-1 in BRISC Series, pages 75-85. University of Aarhus, Denmark, 
1999. 



